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Abstract 



Various adaptive randomization procedures (adaptive designs) have been 
proposed to clinical trials. This paper discusses several broad families of proce- 
dures, such as the play-the-winner rule and Markov chain model, randomized 
play-the-winner rule and urn models, drop-the-loser rule, doubly biased coin 
adaptive design. Asymptotic theories are presented with several pivotal proofs. 
The effect of delayed responses, the power and variability comparison of these 
designs are also discussed. 

1 Introduction. 

As reported by the World Health Organization (Global Summary of the AIDS Epi- 
demic, December 2006), the estimated number of people living with HIV is 39.5 
million, causing 2.9 million deaths in 2006 and 13% are children under 15 years. The 
alarming magnitude of AIDS epidemic and outbreaks of other fetal contagious dis- 
eases such as SARS reveal how vulnerable our health care system is. In order to 
search for more effective treatments, efficient clinical studies are urgently needed. In 
clinical trials, the traditional balanced (or 50%-50%) treatment allocation rule been 
challenged due to its possible unethical consequences. A frequently quoted clinical 
trial is the study of the drug AZT in reducing risk of maternal-infract HIV transmis- 
sion. While half of the pregnant women (239) are given the AZT drug, the remaining 
mothers (238) receive the placebo when 50%-50% allocation scheme is used. Only 20 
infants are HIV-positive in AZT group and 60 in the placebo group, (c.f., Connor et 
al., New England J. Medicine, 1994). Balanced allocation resulted in many failures 
in the placebo group. 

Yao and Wei (1996) redesigned the AZT trial using an adaptive allocation rule, the 
randomized play-the- wiener rule proposed by Wei and Durham (1978), and showed a 
reduction of several treatment failures under adaptive allocation. Adaptive designs, 
an important subdivision of experimental designs nowadays, are allocation rules in 
which the probability a treatment assigned to the coming patient depends upon the 
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results of the previous patients in the study. The basic goal is to skew allocation 
probabilities to favor better treatment performance. 

Early important work on adaptive designs was carried out by Thompson (1933) 
and Robbins (1952). Since then, a steady stream of research (Zelen (1969), Wei and 
Durham (1978), Wei (1979), Eisele and Woodroofe (1995), etc) in this area has gener- 
ated various treatment allocation schemes for clinical trials. This paper provides the 
recent theories of several broad families of designs. In Section [2J we state some limit 
results on martingale, which are the basic tools to derive the asymptotic properties 
of adaptive designs. In Section [3j we consider Zelen's play-the-winner rule and its 
generations by Lin, et al (2003). In Section HJ we derive the asymptotic properties of 
the adaptive designs based on urn models, a large family of randomization procedures. 
In Section El the drop-the-loser rule is introduced. In Section 0, an important family 
of target-driven designs, the doubly adaptive biased coin designs, are discussed. In 
Section [71 the effect of the delay of treatment results is discussed. In Section [HI we 
compare the variabilities of different type of adaptive designs. The lower bound of 
the asymptotic variability for a pre-specified allocation proportion is established, and 
asymptotic best adaptive designs are provided. Finally, further discussion and future 
topics are mentioned in Section O For the convenience of reading, we also give several 
pivotal proofs. The principle ideas of deriving asymptotic properties can be found 
from these proofs. Those who are not interested in the theoretical results can skip 
these proofs and go quickly to the last two sections. 

The following notations and definitions are introduced to describe the random- 
ized treatment allocation schemes. Given a clinical trial with K treatments. Let 
Xi,X2,... be the sequence of random treatment assignments. For the m-th sub- 
ject, X m = (X m> i, . . . ,X m ^K) represents the assignment of treatment such that if the 
m-th subject is allocated to treatment k, then all elements in X m are except for 
the k-th component, X mik , which is 1. Let N„ ;k be the number of subjects assigned 
to treatment k in the first n assignments and write N n = (N n< i, . . . ,N n ^)- Then 
N n = Ylm=i X m - We are interested in the statistical behavior of proportions N n> k/n, 
k=l,...,K. 

2 Preliminaries, limit theorems on martingales. 

The martingale approach is the basic tool to investigate the asymptotic properties of 
adaptive designs. In this section, we state some limit theorems on martingales. For 
more results, one can refer to Hall and Heyde (1980) and Stout (1974) or other text 
books. Let {M n ,T n \n > 1} be a real martingale sequence with AM n = M n — M n _i 
being it difference. 

Theorem A (LLN) Let rj n > be a sequence of random variables such 7] n is T n -\ 
measurable and r\ n /* a.s., and p is a real number in (0, 2]. 

(a) Then with probability one, M n /r] n — > on the event {rj n — ► oo, Y^m=i ^\\^^m\ v \^F m -i\ / Vm < 
oo}. 

(b) IfJ2m=i E[{^-^m) 2 \^Fm-i\ < Vn a -S-, then with probability one M n /r) n — > on 
the event {r]„ — > oo}. 



Theorem B (LIL) Suppose sup m £[|AM m | p ] < oo for some p > 2, then M n = 
0(y/n log logn) a.s. 

Theorem C (CUT ) Suppose there is a constant a > such that Y2m=i E[{^M m ) 2 \J r m ^i 
a 2 . Further assume that the conditional Lindberg condition 

1 n 

- E[(AM m ) 2 I{\AM m \ > eV^|^-i] ^ 0, Ve > 
n 

is satisfied. Then M n /y/n ^ N(0,a 2 ). Further, M [nt] /y/n ^ aW{t) in D[0,oo), 
where W(t) is a standard Brownian motion. 

Theorem D fSkorokhod embedding theorem ) In a possibly enlarged probability space 
in which there is a standard motion B(t), we can redefine the martingale sequence 
{Mn^n} without changing its distribution and define a non-decreasing sequence of 
random variables {r„} such thatr n is T n measurable, M n = B(r n ) and E[Ar m |jF m _ 1 ] = 
£[(AM m ) 2 |^ m -i] a.s., EflA^n^i] < C p E[\AM m \ 2 P \F m _ x \ a.s. for p > 1. 

Theorem E (^Strong approximation^) Let {M n , J-" n ;> 1} be a martingale sequence 
in £% d space, S n = Ylm=i E[(AM m )'AM m \J : ' m -.i]. Suppose that there exists constants 
< e < 1 such that 

oo 

£[||AM n || 2 /{||AM n || 2 > n^F^/n 1 -* < oo a.s., (2.1) 

n=l 

and that T is a covariance matrix which is measurable with respect to jF fc for some 
k > 0. Then for any 5 > 0, (possibly in an enlarged probability space with {M n } being 
redefined) there exist k > and a d-dimensional standard Brownin W(t), independent 
ofT, such that 

S n - W(n)T 1/2 = 0(n 1/2 ~ K ) + 0(a l J 2+s ) a.s. 

Here a n = max m < n ||S m - mT\\ . 

If sup n E|| AM n \\ 2+s ° < oo for some 6~ > 0, then the condition (12. ip is satisfied. 
The proofs of Theorem A (a), Theorem C and Theorem D can be found in Hall and 
Heyde (1980). The proof of Theorem A (b) can be found in Stout (1974). Theorem B 
can be proved by Theorem D and the LIL of a Brownian motion. The proof of Theorem 
E is given in Zhang (2004). Also, Theorems A-C remain true for martingales in a M d 
space with some necessary notations changed. 

3 Play-the- winner rule and Markov chain adaptive 
design 

Consider a two-arm clinical trial: two treatments (1 and 2) with dichotomous response 
(success and failure). Patients (subjects) are recruited into the clinical trial sequen- 
tially and respond immediately to treatments. Zelen (1969) proposed the following 



design, which is well known as the play-the-winner (PW) rule: A success on a partic- 
ular treatment generates a future trial on the same treatment with a new patient. A 
failure on a treatment generates a future trial on the alternate treatment. Let pi be 
the success probability of a patient on the treatment i, qi — 1 — Pi, i — 1, 2. Then 

► : and y/n( ■ ) -> N[0,a PW ), 

n qi + q 2 n q x + q 2 

where o% w = qiq 2 (pi + P 2 )/(qi + q 2 ) 3 - Notice that q 2 /(qi + q 2 ) > 1/2 if p x > p 2 . So, 
if treatment 1 is "doing better", the PW rule favors treatment 1. 

Lin, Bai, Chen and Hu (2003) extended the PW rule to a general Markov Chain 
adaptive design. Suppose that at the stage m, the treatment 1 is assigned to the mth 
patient. Then the (m + l)th patient will be assigned either treatment 1 or treatment 
2 according certain probabilities, which depend on the response of the mth patient. 
Let a s be the probability of assigning the (m + l)th patient to treatment 1, when the 
response of the mth patient to treatment 1 is "success", and let a/ be the probability 
of assigning the (m+l)th patient to treatment 1, when the response of the mth patient 
to treatment 1 is "failure". Similarly define f3 s and j3f with treatment 2 instead of 
treatment 1 in the definitions of a s and af. When a s — 1, a/ = 0, (3 S — 1, /?/ = 0, 
we get Zelen's PW rule. A clinical application disadvantage of the PW rule is that 
it is fully deterministic, i.e., when the previous results are known, the assignment of 
the next subject is fully determined. The Markov chain adaptive design is not fully 
deterministic except when the parameter a s , ctf, (3 S and (3f take extreme values and 
1. When taking a s = atf = /3 S = /3/ = 1/2, we get the fully randomization procedure 
which allocates patients to each treatment with a probability 1/2. The more are 
the parameters near extreme values, the more is the procedure being deterministic. 
When a s < af and /3 S < j3f, the Markov chain adaptive design is less ethical than the 
balanced allocation (c.f., Equation ()3.ip ). The parameters a s , af, (3 S and (3f can be 
chosen to reflect the trade-off between the degree of randomness and ethic. 

Let pi(m) = P{success \X m> i = 1} and p 2 {m) = Pjsuccess \X m> i = 0} and 
a m = pi(m)a s + (1 - pi(m))a/, (3 m = p 2 {m)(3 s + (1 - p 2 (m))(3 f . Then {X mA } is a 
Markov chain with the transition probability matrix 

ct m 1 OLfi 

1 Pm fim 

When pi(m) = p\ and p 2 (m) = p 2 for all m, a n = a, (3 n = j3, and {X ml } is 
a homogeneous Markov chain with a stationary distribution (/i, 1 — fi), where fi = 
(1 — /3)/(2 — a — /3). Following from the central limit theorem for Markov chains we 
have that 

- (j, and Vn( — ' /i) — > N(0, a ) (3.1) 



n n 



where 



^2 yu r r Y U9 f r- , Jy v x (1 - a)(l - (3)(a + (3) 
a = Var^{X l5l } + 2 ^ Cov^X^, X jtl } = _ - _ . 

J— 2 



For non-homogeneous case, Lin, Bai, Chen and Hu ( 2003) proved (13.11) under the 
condition that 

\pi(m) -pi\ + Mm) -p 2 \ 



E 



jvn 

m=l 



< OO. 



Lin, Zhang, Cheung and Chan (2005) established the strong approximation for 
N n i, from which (13.11) follows immediately. 

Theorem 3.1 In a possibly enlarged probability space, we can redefine the sequence 
{^n,i} without changing its distribution, such that 

N n> i -nfi- aW{n) = O ((ra log logra) 1/4 (logra) 1/2 ) + 0(A n ) a.s., 

where {W(t)} is a standard Brownian motion and A n = z~2m=i(\ a m ~ a \ + \Pm — P\) ■ 

Proof. Write Am m = X m l — E[X mj i|.F m _i], where J- m -\ is the history sigma field 
generated by X m>1 , . . . , X m _ 1)X . Then X m = Am m + E[X m>1 \ f ra _i] = Am m + 1 -@ m - 
{l-a m -(3 m )X m _ hl . It follows that N n> i = m n +l-f3-{l-a-(3)N n ^ l +0{A Tl )+0{l). 
So, 

iV" ni i = n/i + m n /(2 -a- 13) + 0(A n ). 

For the martingale m n , we have E[(Am m ) 2 | F m -x\ = (1 - /3 m )/3 m + (a m - f3 m )(l - a m - 
/3 TO )X m _ 11 and |Am m | < 1. It follows that 
it 

E[(Am m ) 2 | T n 



m— II 



m=l 



=n(l -p)P+{a-p){\-a- (3)N n _ 1A + 0{A n ) 

=n(l - /3)/3 + n(a - /3)(1 - a - + ( " ~f )(1 - - ''~ P) m n + 0(A n ) 

2 — a — p 

(l-a)(l-/3)(a + /3) , , ^. ^— . 

=n v H± + + ( V / nloglogn) 

z — a — p 

=.na 2 M + 0(A n ) + 0(a/™ log logra) a.s. 

due to the LIL (Theorem B). Applying the Skorokhod embedding theorem (Theo- 
rem D), we can write m m = B{j m ). Notice J2m=i {^ T m ~ E[AT m |.F m _i]) is also a 
martingale sequence. By the LIL, we conclude that 



r n = 7^ E[Ar m | F m -i) + 0(y/n log logra) 

m=l 
n 

= ^ E[(Am m ) 2 | T m ^ + 0(V" log log n) 



m=l 



~-na 2 M + 0(A n ) + 0(^n log logra) a.s. 



It follows that 



m n =B(r n ) = B(na 2 M ) + o(\J (A n + ^ralog logra) logra, 
=B(na 2 M ) + 0(A n ) + 0((ra log lograf/^logra) 1 / 2 ) a.s. 



by the sample properties of a Brownian motion (c.f., Csorgo and Revsz (1980)). The 
proof is now completed by letting W(t) = B^taj^) / o~m and noticing that a 2 M /(2 — a — 
(3) 2 = a 2 . □ . 

For the multi-treatment case, we let Pi(m) = P(success \X m ^ = 1}. Assume the 
transition probability matrix of the Markov chain {X n } is H n = {Hij(n)} which is 
a function of Pi(n), i = 1, . . . ,K, i.e, E[X n+ i|X n ] = X n H n . By using Theorem E 
instead of the Skorokhod embedding theorem, Zhang (2004) showed that N n can be 
approximated by a mult i- dimensional Browian motion: 

n 

N n -nv- W{n)H = o(n 1/2 - K ) +0(J2\\ H k~ H\\) a.s., 

k=l 

where k > 0, v = (vi, ■ ■ ■ ,vk) is the left eigenvector corresponding to the largest 
eigenvalue of H with vi + . . -+vk = 1, {^(£)} is a ^-dimensional standard Brownian 
motion. In particular, we have asymptotic normality, if 

n 

J2\\H k -H\\ =o{n 1 ' 2 ). (3.2) 
k=i 

Zhang (2006) studied a kind of non-humongous Markov chain designs, in which H n 
is a function of an estimated unknown parameters 6. In this case the condition (13. 2p 
is not satisfied. 



4 Randomized play-the- winner rule and urn mod- 
els 

To overcome the drawback that the PW rule is fully deterministic, Wei and Durham 
(1978) introduced the following randomized play-the-winner (RPW) rule: We start 
with {01,(3) balls (type 1 and 2 respectively) in the urn. If a type k ball is drawn, a 
patient is assigned to the treatment k, k = 1,2. The ball is replaced and the patient's 
response is observed. A success on the treatment 1 or a failure on the treatment 2 
generates a type 1 ball in the urn; A success on the treatment 2 or a failure on the 
treatment 1 generates a type 2 ball in the urn. Let Y n l (Y n>2 ) be number of balls of 
type 1 (2) after n stage. From the results of Athreya and Karlin (1968), we have 

Y n ,i <? 2 , N U;1 q 2 

a.s. and > a.s.. 



Y nA + F„, 2 Qi + <?2 n qi + q 2 

The limiting proportion is the same as that of the PW rule. We refer to it as urn 
proportion. When pi + p 2 < 1-5 (or qi + q 2 > 0.5), we have the following asymptotic 
normality: 

/-, Y n> i q 2 v (M2 \ 

Vn{ Y nA + Y n , 2 qi + q 2 '^ [ ' {2{q 1 + q 2 )-l){q 1 + q 2 ) 2 > 



and y/n(N njl /n - q 2 /(qi + 92)) -> N(0,<7jj PW ), where 

2 gig2[5-2(gi + g 2 )] 

^ " [2(g 1 + g 2 )-l](g 1 + g 2 ) 2 - (41J 

The asymptotic normality was first given in Smythe and Rosenberger (1995). When 
9i + 92 < 0.5, the limiting distributions of both the urn composition and the allocation 
proportion are unknown. The RPW rule has the same limiting allocation proportion 
as the PW rule. But the asymptotic variability is much larger. 

As multi-treatment extensions of the RPW rule, one large family of randomized 
adaptive designs can be developed from the generalized Polya urn (GPU) model. Urn 
models have also long been recognized as valuable mathematical apparatus in many 
areas including physical science, biological science, engineering, information science, 
the study of economic behaviors, etc. 

Consider an urn containing balls of K types. Initially, the urn contains Y = 
(Y 0j i, • • • , Yq^k) balls, where Y"o,fc denotes the number of balls of type k, k — 1, • • • , K. 
A ball is drawn at random. Its type is observed and the ball is then replaced. At the 
mth stage, following a type k drawn, D k j(m) (> 0) balls of type j, for j — 1, • • • , K, 
are added to the urn. D^{m) is a random function of the response of the m-th 
subject on treatment k. The expectation of the total numbers of balls added in each 
stage is assumed to be the same (say 7), so 

K 

^2 ^{Dkjim^Fm-i} = 7, fc, j = 1, • • • ,K,m = 1,2, • • • , 
i=i 

where T m -\ is the history sigma field. Without loss generality, we can assume 7 = 1. 
Let H m be the matrix comprising element {hij(m) = E[Dkj(m)\J : ' m -i\} and D rn = 
{Dkj(m)}. We refer to D m as the adding rules and H m as the design matrices. If 
D m , m = 1, 2, . . . , are i.i.d., then H m = H for all n. In this case the model is said to 
be homogenous. In general, it is assumed that H m — > H. Let Y n = (l^,i, • • • , Y nt x), 
where Y n>k represents the number of balls in the urn of type k after nth stage. And 
let v — (v 1, • • • , vk) be the left eigenvector corresponding to the largest eigenvalue of 
H with v 1 + . . . + vk — 1- Then v k is just the limiting proportion of both the patients 
assigned to treatment k and the type k balls in the urn. Bai, Hu and Zhang (2002), Hu 
and Zhang (2001) obtained the asymptotic properties via the strong approximation. 

Theorem 4.1 Suppose that {D m } is sequence of i.i.d. random matrices with sup m E||D m || 
00. Let Ai = 7 = 1, A 2 , . . . , \k be the eigenvalues of H, and A = max{i?e(A 2 ), . . . , i?e(A^)} 
If X < I, then 

N ■ Y 



Vi a.s. and — ^ > Vi a.s. (4.2) 

n T K Y ■ 



If\< 1/2, then 



^(IH _ v ) 2> M0, S) and ~ v) % N(0, S*). (4.3) 

n n 



Proof. Write 1 = (1, . . . , 1), \Y n \ = £f =1 Y nk , H = H-l'v, M n = £^ =1 X m {D m - 
H), m m == J2m=i ( X ™ - ^[Xml^m^x]) . Then HI' = 1', m n l' = 0, and the eigen- 
values of H are 0, A2, . . . , Xr- We have the following lemma on matrices, the proof of 
which can be founded in Hu and Zhang (2004a). 

Lemma 4.1 If AQ n = AP n + Q n _iH/(n -l),n> 2, then 

\\Q n \\ = 0(\\P n \\) + J2 ° (l|jPmll) (n/m) A log- 1 (n/m), 

m=l 

where v is the degree of the second largest eigenvalue of H . 
We prove (14.31) only. Notice 

A(Y n - nv) = X n D n -v = AM n + Am n H + t^tH - v 

I *n-l\ 

= Y n-l-( n ~ l ) V H + + Amn H 

n — 1 

+ (44) 

Multiplying 1' yields \Y n \ —n = AM n l' + \ Yq\ = 0(y/n log logra) a.s. due to the LIL 
(Theorem B). It follows that 

M n + m n H f-j^r -v\H= 0(^n\og\ogn) a.s. 

By applying Lemma |4~T1 and noticing A < 1/2, we obtain Y n —nv = 0(y/n log logn) a.s. 
Write D[ k) = (D kl , . . .,D kK ), Si = diag(v) - v'v and S 2 = £f =1 «*Var{D? } }. For 
the martingale (Af n , m n ), we have sup n E|| AM n \\ 2+s < 00, ||Am„|| < K, E[(AM" n )'Am n |jF, 
and 

~Y rrii \ 1 ~Y ri -i -1 



m— 11 



E|"(Am m ) / Am m |^ r m _il = ^[di a# (jrrr) - (tttt) 



m=l m=l 



Si + 0{\Jn log log ) a.s. 



n A' 



m=l m=l k=l 



Ef(AM m )'AM m | F m _^ = Yl Va r i D i } }Wl^m| 

1=1 fc=i 

S2 + 0{\Jn log log n) a.s. 



By the strong approximation (Theorem E), there are two independent d-dimensional 
standard Brownian motions Wi(t) and W^t) such that for some k > 0, 

m n = Wi(n)S| /2 + o(n 1/2 ~ K ) a.s., M n = W 2 {n)T l l J 2 + o(n 1/2 ~ K ) a.s. (4.5) 



Without loss generality, we assume 1/2 — k > A and 1/2 — k> 1/4. Let Gj(t) be the 
solution of the equation 



G?i(t)= / ^Mjfrfx + W.^S^ 2 , G l (0)=0, (4.6) 
Jo x 

i = 1,2. Notice {[Wi(T(- + s)) - W^T-)]/^} = {Wi(-)}- Tt can be checked that 
Gi(t) is a Gaussian process with stationary increments and Var{Gj(t)} = tVar{Gj(l)}, 
and 

f = y 9M. + 0((logn) 3 / 2 ) a.s. (4.7) 

Combing (OI)- (H~7]) yields 

^ n r \ n r \tt ^Y m -mv-G 2 {m)-Gi(m)H~ 
Y n -nv - G 2 (n) - G x {n)H = > H + P n , 

m=l 

where 

P n = 0((logn) 3 / 2 ) + o{n^-«) + £ (l - (|^| -v)H 

n— 1 2 

=0((logn) 3/2 ) + o(n 1/2 ~ K ) + ^ (V(log log m)/m) = o(n 1/2 - K ) a.s. 

m=l 

Now, by applying Lemma [4. II we conclude that 

Y n - nv = G 2 (n) + G x {n)H + o(n 1/2 ~ K ) a.s. 

Finally, 

n-l y n-1 X y X 

N n - nv =m n + ^2 tttt - ™ v = m « + ( r\F7 ~~ V I ~ 1 

n W n V Wl / 

m=0 m=u ' 

n-l 

=m„+> (J-l«) 



m 

m=l 



m=l 

n—1 



+ _ G 2 (m) + Gi{m)H _ + /2 

m=l 







=Gi(n)+ r^^-dx{I-l'v)+o{n l l 2 - K ) a.s. 
Jo x 



(14. 3 p is now proved, where X and S* are the variance-covariance matrices of normal 
random variables (?2(1) + Gi{l)H and Gi(l) + f Q G 2 (x) /x dx (I — l'v), respectively. 
For the details of specifying and estimating the variance-covariance matrices, one can 
refer to Bai and Hu (2005), Hu and Zhang (2004b), Zhang, Hu and Cheung (2006). 
□ 

The interested one can check that for the RPW rule, the equation (14.61) reduces to 

G hl (t) = A f ^^-dx + aW®, G itl (0) = 0, 
Jo x 

Gi^{t) = —Gi,\{t), and the solution is G i; i = o~it x x~ x dWi(x), i = 1,2, where A = 
l-<?i-<?2, o\ = v x v 2 = q^/iqi+Qif and<r| = v 1 p 1 q 1 +v 2 p 2 q2 = <?i<?2G»i+P2) /(?i+£>)- 
When only the second moment of D m is assumed to be finite, we can show (14. 3[) 
with a similar argument by applying the weak convergence of martingales (Theorem 
C) instead of the strong approximation. For details one can refer to Hu and Zhang 
(2001). Janson (2004) studied the urn models by embedding them to continuous 
branching processes and established the asymptotic normality in a different way. 

Example 4.1 As a multi-treatment extension of the RPW rule, Wei (1979) proposed 
a GUP to allocate subjects, in which the urn is updated in the following way: at the nth 
stage, if a subject is assigned to treatment k and cured, then a type k ball is added to the 
urn, otherwise, if treatment k for a subject fails, then -^-j- balls are added to the urn 
for each of the other K — 1 treatments. In this urn model, H = {h^, k, j = 1, . . . , k}, 
where h kk = p k and h k j = q k j{K — 1) (j ^ k), and p k is the successful probability of 
treatment k, q k = 1 -p k . So v k = (l/q K )/ Y,f=i0-/Qj)- 

For the non-homogenous case, Bai and Hu (2005) obtained (14. 3 p under the condi- 
tion that 

00 II M — TT\\ 

y \\*im _«| < ^ a s (4 g) 

This condition can be weakened to Y^n=i \\H m — H\\ = o(^fn) a.s. by use the argu- 
ment in the above proof. An applicable class of non-homogenous urn models is the 
sequential estimation-adjusted urn (SEU) model, in which the urn is updated accord- 
ing to the current response and the current estimate of an unknown parameter, and so 
H m = H(6 m ) is a function of the estimator. In this case, the fastest convergence rate 
of H m is Op(v / m) and the condition (14.81) is not satisfied. Zhang, Hu and Cheung 
(2006) established the asymptotic properties of SEU models. 

Example 4.2 Bai, Hu and Shen (2002) proposed a GUP to allocate subjects, in which 
the urn is updated in the following way: at the mth stage, if a subject is assigned 
to treatment k and cured, then a type k ball is added to the urn. If a failure, then 
^ Pm z}' J — balls of each type j ^ k are added. Where p m -\ j = (S m -i ? + l) / (N m -i i + l), 
and S m -ij is the number of successes of treatment j in previous m — 1 stages. This 
model is a SEU model with H(x) = {hij(x); i, j = 1, . . . , K}, where h kk (x) = p k and 
h kj (x) = qkXjlY.i^k x i k )- 



More examples and applications of SEU models can be found in Zhang, Hu and 
Chueng (2006), in which how to defined a SEU model by using the information of 
distribution parameters to target a pre-specified limiting allocation proportion is dis- 
cussed in details. 



5 Drop-the-loser rule 

The asymptotic normality for the urn models can be obtained only when the condition 
A < 1/2 is satisfied. This is a very strict condition. Even in the case of K = 3, it 
is hard to be satisfied and to check it is not a easy work. Also, when A is close 
or exceeds 1/2, the variability of an urn model is extremely high. Ivanova (2003) 
proposed a drop-the-loser (DL) rule which has the same limiting proportion as Wei 
(1979)'s rule (See Example 14.11) but has much smaller variability. Consider an urn 
containing balls of K + 1 types, type 0, 1, . . . K, when comparing K treatments. A 
ball is drawn at random. If it is type k, k = 1, . . . , K, the corresponding treatment is 
assigned and the subject's response is observed. If the response is a success, the ball is 
replaced and the urn remains unchanged. If a failure, the ball is not replaced. When 
a type ball is drawn, no subject is treated, and the ball is return to the urn together 
with one ball of each type k, k = 1,...,K. Ivanova (2003, 2006) established the 
asymptotic normality after embedding the urn process to a death-and-immigration 
process. Here we give the strong approximation. 

Theorem 5.1 There is a K -dimensional standard Brownian motion W(t) such that 
N n — nv = W(n)diag(^^, . . . , J^^) (I ~ + o{n l ' 2 ~ R ) a.s., 

for some k > ; where Vk = (l/qk)/ Ylj=i(l / Qj) > k — 1, . . . , K . Hence 

Vn-(N n /n - v )Zn(0,(I- vl')diag(^, . . . , ^) (I - l'vj) . 

\ qi qx / 

In particular, in the two-treatment case, y/n{N n /n — v) — > N{Q,a 2 DL ) with a 2 DL = 
Qi&ipi + P2)/(Qi + ffe) 3 , the same as the o~ 2 PW . For the generalizations of the DL rule 
and their applications, one can refer to Zhang, Chan, Cheung and Hu (2007), Sun, 
Cheung and Zhang (2007). 

Proof of the theorem. Let Z m = (Z m> o, . . . , Z m ^) be the urn compositions after 
the m-the assignment. And let \x m be the number of draws of type balls between 
the (m — l)-th assigment and the m-th assignment. Remember that when a type 
ball is drawn, we add one ball of each treatment type, and when a treatment type 
ball is drawn, it is replaced only when the response is a success. So 

Zm,k ^m— l,fe l^m -^m,k^m,k 1) /^m -^-m,kqk -^-m,k^m,k Pk)i 

where ^ m ,k = 1 if the response of the m-th subject on treatment A; is a success, and 
if failure. Let M njfc = YZi=\ X m,k(^m,k ~ Pk), M n = (M n>ll . . . , M njK ), and A m be the 



sigma field generated by £i jfe , • • • , £ m>fc , k = 1, . . . , K, and X u . . . , X m , X m+1 . Then 
{M n ,A m } is a martingale. It follows that Z n>k - Z 0fi = YT m =\Pm ~ N n>k q k + M mM . 
We can prove that Z n ^ k = o(n 1,/2_<50 ) a.s. for some 5q > 0. For details of the proof, 
we refer to Zhang, Hu, Chueng and Chan (2006b), Sun, Cheung and Zhang (2007). 
Hence 

n 

N n ,k = Vm/Qk + M m , k /q k + o{n 1/2 - So ) a.s., k = 1, . . . , K, 

m=l 

which, together with the fact iV n) i + • • • + N Ht x = n, implies 

N n — nv = M n diag(l/qi, ±/q K ){I - l'v) + o(n 1/2_5 °) a.s. 

So, N n — nv = 0{y/n log log n) a.s. by the LIL (Theorem B). On the other hand, for 
the martingale {M n } we have 

n 

E[(AM„)'AM n |7 m _ 1 ] = diag(N ntlPiqi , N n , K p k q k ) 

m=l 

= n diagtytfiqx, v K p k q k ) + 0(y/n log logra) a.s. 

By applying the strong approximation (Theorem E), we can define a i^-dimensional 
Brownian motion W(t) such that 

M n = W(n)diag(y/v^p^, ^v K p k q k ) + o(n 1/2 ~ K ) a.s. 

The proof is now completed. □ 



6 Doubly adaptive biased coin designs 

The PW rule and urn model designs are a kind of design-driven adaptive designs, 
which are constructed with intuitive motivation. However, clinical trials are usual 
complex experiments on humans with multiple, often competing, objectives, includ- 
ing maximizing power to detect clinically relevant differences in treatment outcomes, 
maximizing the individual patient's personal experience while treated in the trial, and 
minimizing the total monetary cost of trial. These and other objectives can be defined 
in terms of optimization of function of the trial's parameters, the optimal allocation 
proportion is often a function of unknown parameters. Take a binary response clinical 
trail with two treatments 1 and 2 as an example. The well known Neyman proportion 
is 

P{PuP2) =■ ; = . , — 

where p k (q k ) is the probability of success (failure) of a trial treatment k, n k is the 
number of subjects assigned to treatment k, k = 1, 2, The Neyman proportion maxi- 
mizes the power of a test of the simple difference p\ — p 2 for fixed sample size n. But 
if we implement Neyman allocation, when p\ +p 2 > 1, we will assign more subjects to 
the inferior treatment, which will compromise the ethical objective. Rosenberger, et al 



(2001) discussed another important optimization criteria that minimize the expected 
number of treatment failures, n\qi + n 2 q 2 , for fixed the variance, p\q\jri\ + p 2 q 2 /n 2 , of 
the statistic pi — p 2 under an alternative hypothesis p\ 7^ p 2 . This leads the optimal 
proportion as follows. 

P(P1,P2) = ^ 

y/Pl + VP? 

Many simulation studies have validated that a adaptive design with this proportion 
as its target performs very satisfactorily for both the consideration of ethic and the 
test of power. For other optimization criteria one can refer to Jennison and Turnbull 
(2000) and Rosenberger, et al. (2001). Both Neyman allocation and the allocation of 
Rosenberger,et al (2001) cannot be implemented directly in a clinical trail, because 
we do not know the values of pi and p 2 . In this section, we introduce an important 
class of adaptive designs, doubly adaptive biased coin designs (DBCD), which is first 
proposed by Eisele (1994) and Eisele and Woodroofe (1995), to target a pre-specified 
allocation proportion. 

Consider a clinical with K treatments. The outcome of a subject on treatment 
k has a distribution fk(-\0k)- Write 6 = (61, . . . ,9 K ). The pre-specified allocation 
proportion is v = p{6) = (pi(#), . . . , p m {6)). Here p{y) is assumed to be continuous 
function on the parameter space, taking the values on (0, l)® k and twice differentiable 
at the true value of the parameter 6. A multi-treatment DBCD proposed by Hu and 
Zhang (2004a) is defined as follows. 

To start, allocate M subjects to each treatment. At stage m, suppose m — 1(> 
MK) subjects are allocated and the outcomes, N m -\^ outcomes of treatment k, 
k = 1, . . . , K, are observed. Let Qm-i^ be the MLE of the parameter 9k, k = 1, . . . , K. 

Write Q m -\ = (dm-1,1, " " " ,@m-i,K), and let p m -i = p(#m-i) be the current estimate 
of the target allocation proportion. Now, the m-th subject is allocated to treatment 
k with a probability: 

N _i ^ 

P m ,k ='■ P(X mtk = 1|JF m -i) = Qk{ ^— T , Pm-l) , (6.1) 

TXX X 

k = 1,...,K. Here g(x, y) = ( gi (x, y),..., g K {x, y)) : (0, l)® 2k -> (0, lf k is the 
allocation function. Write P rn = (P m ,i, ■ ■ ■ , P m ,K)- 

Theorem 6.1 Suppose the distributions . . . , fxi^R) follow an exponential 

family. Let g(x, y) be defined as 

g k (x,y)= v v / , T , k=\,...,K- 7 >0. (6.2) 



Then 



N n — nv = 0(^n\og\ogn) a.s. and y/n(N n /n — v) — > N(0, S), 



where 

S = S p + — — -(diag(v) - v'v + S p ), 
1 + ^7 



?£) diag({v 1 I 1 {0 1 ))- 1 ,...,v K I 1 (0 K ))- 1 ) % 



and Ik(Ok) is the Fisher information function for a single observation on treatment k. 

In particular, for the two-treatment case, suppose the targeted allocation proportion 
of treatment 1 is p = p(9%, 9 2 ). Then y/n(N nt i/n — p) —* N(0, o~ 2 DBCD ) where 

°DBCD = ° 2 P + - P) + a l) 

and a 2 p = (I 1 {9 l )p)-\dp/de 1 f + (/ 2 (0 2 )(1 - p)y\dp/d6 2 y. 

Example 6.1 Consider the binary response clinical trail with two treatments. For 
the urn proportion p = (72/(91 + Q2), 

2 = giggCgi +P2) 2gig 2 

® DBCD ( ft + fe )S + ( l + 2 7 )(g 1 + g2 )3- 

For £/ie Neyman proportion p = ^/pm/(VPiQi + y/P^), o- 2 DBCD = 

ypigiP2g2 



(I+27) ( s /pl qi+^/p^) 



1+7 / P2g2(qi-pi) 2 _i_ pigi(g2-P2) 2 

2(l+27)(^/pTgT+^/p5g5') 3 I VPTgr v 7 ?^ 



For Rosenberger, et al's proportion p = >/pi/(v/pi + 1/P2); Vdbcd 



VP1P2 _^ 1 + 7 / P2gi Pi<?2 



(1 + 2 7 )( V ^T+ V ^) 2 2(1 + 2 7 )( V ^T+ V ^) 3 Vv^T 



From Theorem 16. 11 we find that the asymptotic variability is a decreasing function 
of parameter 7. However, the degree of randomness of the design decreases when 7 
increases, because, as the value of 7 becomes larger, the allocation probabilities shift 
faster to extreme values and 1 if there is a bias between the current sample allocation 
and the estimated target. When 7 = 00, the variability of the procedure is minimized, 
but the procedure is completely predictable. The parameter 7 can be chosen to reflect 
the trade-off between the degree of randomness and the variability. 

The allocation function defined in (16. 2p is very special though it has fine properties. 
For results for general allocation function g{-,-), one can refer to Hu and Zhang 
(2004a). 

Proof the Theorem. Let £ m &, m — 1, 2, . . ., be i.i.d. random variables, which 
represent the outcomes on treatment k, k = 1, . . . , K. In clinical trial, only X m ^ m ,kS 
are observed. Write £ m = (£ m ,i, . . . ,£ m ,k)- F° r simplifying the proof, we assume that 
9k = E£ m) fc is the mean of the outcomes, and so we use the sample mean as it estimate: 

V m ,k — T-j , ft- — 1, . . . , -TV. 

^m,k 



In practices, if necessary, we can add a > in the numerator and (3 > in the 
denominator to avoid the nonsense case of 0/0, or to use prior information to estimate 
the parameters. Assume E|£ mj fc| 2+<5 < oo and write a\ = Var{£ mj fc}. Write Q m ,k = 
YjT=x x m,k{im,k - Ok), then Q m)k is a martingale and 9 m>k - 9 k = Q m ,k/N mM . By the 
LIL (Theorem B), we have 

Qm,k = 0(^Jn log \ogn) a.s. (6.3) 

We first show the consistency of N n /n. If let A m = cr(X 1; . . . , X m+1 , £ 1; . . . , £ m ), 
then YZi E[(X iik (^ k - e k )f\Ai-i] = v 2 k N m ,k- By Theorem A (b), it follows that 

m ,k — > Ok cts. on the event {N mjk — > oo}. (6.4) 

On the event {N mjk < oo}, Q m ,k will fix to a value eventually. In either case, 9 rriyk 
has a limit 9 k in the parameter space, k = 1, • • • ,K. By the continuity of p{-), 
Pm~i — > p{9\, ■ ■ ■ , 9k) '■= v G (0, 1)® K a.s. Notice that the minimum of Y^j Uj(yj/ X j)' y 
over J2j x j = 1 anci x j > is £\ Vy lt is easily seen that g k (x, y) < y fe (t/fc/x fc ) 7 < y k 
if x k > y k and J2j x j = J2jVj = l - So > p ™,k < Pm-i,k if N m _ 1>k /(m - 1) > p m -i,k- 
Denote M nM = J2 m =i( X m,k - E[X m ,k\ Fm-i])- Let S n = max{m > MK+1 : iV m _ 1)fc < 
(m — l)p m _i 5 fc} and max{0} = Mif. Then 

n n 

N n>k = N Sn , k + (A mjfc — E[X m)fc |jF m _i]) + P m) fc 

m=S„+l m=5 n +l 

<1 + N Sn -i, k + M„j, - M Sn , k + Yl P m > k 

m=S„ 

n-1 

<1 + N MK _ hk + (M ntk - M Sn , k ) + (S n - l)p Sn -x, k + P™> k ' ( 6 - 5 ) 



Notice that \M nyk — Ms n)k \ < max m < n |M mi fe| = 0(\?n log logn) a.s. by the LIL, and 
Pm,k — > v k . We conclude that limsup,^^ N n ^ k /n < v k a.s., k — 1, . . . , K. From the 
fact that ^2 k=1 Nn tk /n = J2k=i^k = 1, we conclude that N n ^ k /n — > v k G (0,1) a.s., 
which implies that N n)k — > oo, k — 1, . . . , K. Hence 9 k and 9 k (v and v) must be iden- 
tical by (16.41) . We have proved the consistency of N n /n. Further, according to (16. 3|) . 
we have 6 m — 6 = O ( A/log log m / y/m) a.s., and then p m — v = O (Vlog log mj \[m) 
a.s., which together with (16.51) . yields 

N n)k - nv k < 0(^n log logn) + 0( v / 5'„log \ogS n ) 

n-1 

+ 0( K ^/\og\ogm/ \/m) = 0(a/ n log logn) a.s. 

m=S n 

By the fact that J2k=i ^n,k/ n — Y^=i^k — 1 again, we conclude that 



iV„ — nv = 0(^/n log log n) a.s. 



(6.6) 



Now, we begin the proof of the asymptotic normality. Write M n = (M n l , . . . , M n ^ K ) 
and Q n = (Qn,i, • • • , Qu,k)- Then by 06.31) and f!6.6p . 

Q m ,k , „ /log log 



m ,k -9k = + O a.s. 

mvk \ m J 

It is easily seen that dg/dx\ x=y = —7(1 — l'v) and dg/dy\ x=y = (7 + 1)(/ — l'v). 
By the Taylor formula, we have 

P m - V = p( — g=i p m -l) ~ P(V, v) 

m — 1 



0(11^ -HD + dl^-i-^ll 2 ) 



m — 1 




N 


-1 


Jl m _ 


1 






m — 1 









<9p ^ /log log m 



90 V m 



,iV m _i . Q m _i /l^P . n f^oglogm\ 

- J{ - v) + (7 + 1 h -r-tZza^-)^ + O 

v m— 1 7 (m — 1) v 06 \ m J 



=0 

Hence 



log log m 



rn 



N n -nv = ^{X m - E[X m \T m -i\) + ^{P m - v) 

m=l m=l 

=M n - 7 £ +(7 + 1) E — + ^((log^) 2 ) a.*. 

m=l m=l 

On the other hand, it is easily checked that, for the martingale (M n , Q n ) we have 

n n 

E[(AM m )'(AM m )|J- m _ 1 ] = ^2(diag(P m ) - P' m P m ) 

m=l m=l 

= n(diag{v) — v'v) + 0(sj n log log n) a.s., 

n n 

^2 E[(AQ mjfc ) 2 | f m _i] = ^ p m,k a l = n Vk<y\ + 0(y/n log logn) a.s., 

m=l m=l 

E[(AM m )'(AQ m )|^ m _ 1 ] = and E[(AQ mjfc Q mJ |^ m _ 1 ] = 0, k ^ j. So, by the 
strong approximation (Theorem E), there are two independent K- dimensional stan- 
dard Brownian motions W(t) and B[t) such that for some k > 0, M n = W(n)S 1 + 
o(n 1 / 2 ~ K ) a.s. and Q n ,k = Bk(n)ak\Jvk + o(n l l 2 ~~ K ) a.s., k = 1,...,K, where Si = 
diag(v) — v'v. It follows that 

n-l _ r n-l \ 

iV™ — mi) , , x -» B(m) 



N n - nv =W(n)^ - 7 £ + (7+1) £ ~ S f 

+ o(n 1 / 2 ~ K ) a.s. 



m * — ' m 

m=l m=l 



Hence, N n — nv = G(n) + +o(n 1 / 2 K ) a.s., where 

G(t) = r 7 / x^dW^x)^ 2 + (7 + [ x^ 1 B(x)dx'E 1 J 2 

Jo Jo 

is the solution of the equation 

G(t) = W(t)nT - 7 f ^dx + (7 + 1) f ^MdxEf . 

Jo x Jo x 

It follows that ^Jn(N n /n - v) 4 A(0, S) with 

£ = Var{G(l)} 

=Si / x 27 cfe + (7 + l) 2 E p / / x^y^ix Ay)dxdy 
Jo Jo Jo 



27 + 1 27 + 1 

The proof is now completed. □ 



7 Delayed responses 

In practices, the outcomes in clinical trials are not available immediately prior to 
the treatment allocation of the next subject. The estimating of the parameters, and 
the updating of the urn when using urn models, can only be processed according to 
observed responses. The effect of the delay of treatment results is fist studied in theory 
by Bai, Hu and Rosenberger (2002) for the urn compositions in an urn model design 
with discrete responses. After that, Hu and Zhang (2004b), Zhang, Chan, Chueng 
and Hu (2007), Sun, Cheung and Zhang (2007) and Zhang, Hu, Chueng and Chan 
(2006a) have shown that the delay machine does not effect the asymptotic properties 
of the sample allocation proportions for many adaptive designs if the delay degree 
decays with a power rate. The basic reason is that the total delayed responses is 
a high order of square root of the sample size when the delay degree decays with a 
power rate. 

To describe the delay machine, we let t m be the entry time of the m-th subject, 
where t m is an increasing sequence of random variables. Assume that {t m+ i — t m } is 
a sequence of independent random variables. The response time of the m-th subject 
on treatment k is denoted by r m (k). Suppose {r m (k);m > 1} are sequence of inde- 
pendent random variables, k — 1, . . . , K. Further, assume that {t m+ \ —t m , r m (k); k = 
1, • • • , K, m > n} is independent of the assignments Xi, . . . , X n . 

Assumption 7.1 Let Sk(m,l) = I{r m (k) > t m+ i — t m } be an indicator function that 
takes the value 1 if the outcome of the m-th subject on treatment k occurs after at 
least another I subjects arrive, and otherwise. Suppose for some constants C > 
and 7 > 2, 



/ifc(m, I) = P{5k(m, n) — 1} < CI 7 , to, I — 1, 2, . . . , k — 1, . . . , K. 



This assumption is easily satisfied. A practical approach is to assume that the entry 
mechanism generates a Poisson process and the delay time has an exponential dis- 
tribution in which both {r m (k)} and {t m +i — t m } are sequences of i.i.d. exponential 
random variables with means A^ > and Ao > 0, respectively. This approach is 
common in clinical studies and the probability /j, k (m,l) is (Afc/(Ao + Afe)) . 

Let S^ s k (resp. N^ s k ) be the sum (resp. the number) of the outcomes on treatment 
k observed prior to the (m+l)-th assignment, and S m;k (resp. N m ^ k ) be the sum (resp. 
the number) of all the outcomes of those being assigned to treatment k in the first m 
subjects, k = 1, . . . , K. 

Theorem 7.1 Suppose Assumptions \ 7. 1\ is satisfied, and the responses on each treat- 
ment are i.i.d. random variables having finite (2 + 5)-th moments. Then for some 
< 5 < \ — 2^5, we have 

R n ,k=-S m ,k-S%% = o(n l l 2 ~ 5 ») a.s. (7.1) 

and N nyk - = o(n l ' 2 - 5 °) a.s., k—l,...,K. 

Proof. Let Ik(m,l) be the indicator function, which takes value 1 if the out- 
come £ m) fc on treatment k of the m-th subject occurs after the (m + /)-th assign- 
ment and before the (m + I + l)-th assignment, k = 1, . . . , K. For given m and 
if /^(m, /) = 1, we observe a response X mjk ^ m>k . Hence, between the m-th assign- 
ment and the (m + l)-th assignment, the observed outcomes are ifc(m, 0)X m!fc £ m)/c , 
I k (m-l, l)X m _i )fc £ m _i jjfc , . . ., I k (l,m-l)X ltk £ ltk , k = 1, . . . ,K, and the sum of those 
on treatment k is X^o* J fe( m ~ l)X m -i,k£m-i,k = Ya=i h{h m ~ M,fc6,fc- Hence, 

n m n n 

S t% = ^2^2 Jfe ^' m ~ l ) X i,kHl,k = ^2^2 X m,k£ m ,klk(m,j - m). 

m=l 1=1 m=l j=m 

On the other hand, it is obvious that S m;k = YZi=i X m,k£m,k = YZv=i Yj% m X m,k£,m,kh{™>, r 
m). It follows that 

n 00 

Rn,k = 22 X] X m,kim,kh{m, j - ITl). 
m=l j=n+l 

Let O<0<l/2bea number whose value will be specified later. Write l n = [n^]. 
Then 

n n—l n 00 

m=n—l n +l m=l j=n—m+l 
n n 

< ^2 X m,k\jm,k\ + 8 k (m, l m ) X m , k \^ m ,k \ 

m=n—l n +l m=l 

n n 

<Cn*+ (IWI - E tlWl]) + J2 Xm ' kE ^ 6k ^ lm ^ m ^ 

m=n—l n +l m=l 

n 

+ ^ X mjk (6 k (m, l m )\£m,k\ ~ E[5fc(m,/ m )|^ m , fe |]). (7.2) 

m=l 



F° r El=i (l£m,fc| ~~ E[|^ mfc |]), due to Theorems 1.2.1 and 2.6.6 of Csorgo and Revesz 
(1981), 

n 

(I Wl - E tl WO) =0(y/l n logn) + o{n^) 



m=n—l n +l 



--o(n <t ') + o(n 2 +0 a.s. 



For the martingale J2m=x X m,k (4(wM m )| £ m>k \ ~ E[6 k (m, l m )\C m , k \}) , notice 

E\X m;k (S k (m, l m )\€m,k\ ~ E[5 fc (m, Z m )|£ m ,fc[ 
<E(5 fc (m,/ m )|e m , fc |) 2 < (E<5 fc (m,/ m ))^(E|e m , fe | 2+5 )^ <m-^. 



2 07(5 



due to Assumptions 17.11 and the assumption of finite (2 + 8)-th moments. So, by the 

LLN (Theorem A (a)), Ylm=i X rn, k {h(mJ m )\^ m:k \-E[5 k (mJ m )\C m , k \}) = o{n~^+ s ) logn) 
a.s. Also 



< (7 m 2 +« < Cn 2 +s . 



J2 X m,kE[S k {m,l m )\D m>k \) < J2 (E5 k (mJ m ))^ (E\D m , k \ 2+s )^ 

m=l m=l 

±YQ±6} _ i 07(1+*) 

^ ; , i . ^ ' /'.,-' ■ > i * 

m=l 

Combining the above arguments yields 

jr -, <p~/(l+S) i 1 07.5 

_R njfc = O(n ) + 0(n 2+i ) 4- 0(722+5) + (n 2 ^+^\ogn) a.s. 

Choosing <p = 1+ j\^ h and < 5 < min{± - 0, § - ^Jj} yields (EE]). The 
proof of iV nifc — A^"° 6 | = o(n 1 / 2_5 °) a.s. is similar. □ 

8 Variability, power and asymptotic best adaptive 
designs 

The variability of the sample proportion N n ^ k /n is an important quantity, which mea- 
sures the distance between N n>k /n and its limit v k . The smaller is the variability, 
the smaller is the probability that there is large bias between N n>k /n and v k . When 
using a adaptive design with high variability, a clinical might result in assigning more 
subjects to the inferior treatment making the allocation even less ethical than equal 
allocation. Also, a trial with high variability might result in assigning only a few 
subjects to one of the the treatments decreasing the efficiency in the test or the esti- 
mation of parameters. According to ( 14. ip . the asymptotic variability of the RPW rule 
is very high unless both treatments have low success rates. It is extremely high when 
qi + q2 is close to 1/2, and it is showed that n\/ar{N ni i/n} — > oo when q% + q 2 < 1/2. 
The RPW rule used in ECMO, 1985, trial assigned only one patient to the less suc- 
cessful control therapy (see Royall, 1991, for discussion). The relationship among the 



power, the target allocation and the variability of the designs is fist revealed by Hu 
and Rosenberger (2003) in theory, though simulation studies had indicated there is 
strong relationship among these quantities. Hu and Rosenberger (2003) proved that 
the average power of a statistical test of the difference of distribution parameters is 
a decreasing function of the variability of the designs. In Section [61 we have found 
that the asymptotic variability of DBCD is a decreasing function of the parameter 7. 
When 7 — * 00, the variability tends to its minimums 

^p=(%) diag((v 1 I 1 (9 1 ))-\...,v K I 1 (e K )r 1 )^. 

Hu, Rosenberger and Zhang (2006) proved that this limit is the lower bound of the 
asymptotic variability of adaptive designs among all adaptive designs which have the 
same limiting proportion. 

Assume the following regularity conditions: 

1. The parameter space ©& is an open subset in & d , j = 1, . . . , K\ 

2. The distributions of outcomes . . ., /#•(• \Qk) follow an exponential fam- 

iiy; 

3. For the limiting allocation proportion p{0) = . . . , Pk{0) £ (0, 1)® K , 

— ^ -»• Pj{0) as- j = l,...,K; 



4. For a positive definite matrix V(0), 



Theorem 8.1 Under regularity conditions 1-4, there exists a @o C = @i <8> ■ ■ ■ <8> 
®k with Lebesgue measure such that for every 6 G — ©o, 

where 1(6) = diag(px(0)Ii(0x), . . . , Pk{9)Ik{Qk)) and Ik(Ok) is the Fisher informa- 
tion for a single observation on treatment k — 1, . . . , K. 

We refer to a adaptive design that attains the lower bound as asymptotic best for 
that particular allocation p(6). Table [1] gives the lower bounds of the asymptotic 
variabilitites (c 2 LB ) for urn proportion (UP), Roenberger, et al's proportion (RP) 
and Neyman proportion (NP) in a binary response clinical trial for two-treatments. 
It is interesting to note that the Zelen (1969) 's deterministic design (PW rule) is 
asymptotic best among all procedures with limiting allocation proportion 52/(91 + 92)) 
the urn proportion. The RPW rule has the same limiting proportion, but it is not 
asymptotic best because o~ 2 RPW > a\ B . The DL rule proposed by Ivanova (2003) is 





P 




UP 
RP 
NP 


92 
<?l+<32 


<?ll?2(Pl+P2) 
1 f P2<?1 1 P192 \ 

4(v^r+v^2") 3 V v 7 ^ v 7 ^/ 
1 ( P2Q2{l-'2pi) 2 I pigi(l-2p 2 ) 2N \ 




4 ( V / PT9r+V / P292) 3 V V / Pl9l \/P292 / 



Table 1: Lower bounds of the asymptotic variabilities for urn proportion, Rosenberger 
et al's proportion and Neyman proportion. 

a random procedure having the same urn proportion and the asymptotic variability 
°dl = QiOzipi +£>2)/(<?i + (fe) 3 - So, the DL rule is asymptotic best. However, both 
the RPW rule and the DL rule can only target this particular proportion, which is 
not optimal in any formal sense, and can only be used for binary responses (c.f., Hu 
and Rosenberger, 2003). 

DBCD is also not asymptotic best (except 7 = 00). But, it can target any desired 
allocation and can be used for general responses, for example, continuous response. 
Zhang, Hu and Cheung (2006) proposed an urn model, the SEU model, which can 
target any pre-specified allocation proportion and can be used for general responses. 
The drop-the-loser rule has also been generalized to GDL model, a kind of urn model 
with immigration, by Zhang, Chan, Cheung and Hu (2007) and Sun, Chueng and 
Zhang (2007), by using the estimators of unknown distribution parameters, such that 
it can target any pre-specified allocation proportion and can be used for general re- 
sponses. For a general pre-specified allocation proportion p = p(6) in a A'-treatment 
trial, the asymptotic variance-covariance matrices of SEU, GDL and DBCD are given 
in Table [2j Among these models, the DBCD can approach the lower bound for large 



Model 


SEU 


GDL 


DBCD 


Variability 


diag(p) - p' p + 6T, LB 


2£lb 


diag{p)-p'p 2+27 v, 
I+27 1 I+27 ^ LB 



Table 2: The asymptotic variability of SEU, GDL and DBCD for a same limiting 
allocation 

values of 7. However, the procedure becomes more deterministic as 7 becomes larger, 
and hence careful tuning of 7 must be done to counter the trade-off between the ran- 
domness and variability. The use of DBCD with 7 = 2 was strongly recommended in 
Hu and Rosenberer (2003) for binary response trials with two treatments, according 
to the simulation study. 

Very recently, we have found a fully randomized biased coin design (RBCD) for 
two-treatment clinical trails, a kind of DBCD, which preserves randomization, attains 
the lower bound, and can target any allocation. Whether or not an urn model can 
be defined to have these properties is still an open problem. In the RBCD, instead of 
using a continuous allocation function g(x, y), we use a discrete function: g(x, y) = ay 
if x > y, y if x = y and 1 — a(l — y) if x < y, where < a < 1. That is, the m-th 



subject is allocated to treatment 1 with a probability 




(ap(d m ^), UN, 

<p(0 m -i), ifJV, 
[1-0(1-^(^1)), ifiV, 



171- 



171- 



771- 



1>1 /(m - 1) > p(6, 
i,i/(m - 1) = p(0, 
i,i/(m - 1) < p(0, 



m 



in- 



711- 



l). 



1), 



When p{0) = 1/2, the RBCD is just the Efron (1971) 's biased coin design. The 
following theorem gives the asymptotic results for the RBCD, the proof of which will 
not be presented here. 

Theorem 8.2 Suppose p(-,-) is a continuous function on the parameter space and 
twice differentiable at the true value of the parameter = (0 1 ,9 2 ), the distributions 
/i(-|#i) and y*2 (■ I ^2) of the responses follow an exponential family. Then N n ^/n — 

p(9) = 0(^\oglogn/n) a.s. and \fn{N n> \/n — p(0)) iV(0, cr^), where 



Further, max m < n |iV ml — mp(0) — o~ p W(m)\ = o{y/n) in probability, where W(t) is a 
standard Brownian motion. 

9 Discussion 

In this paper, we have discussed several classes of adaptive designs. The play-the- 
winner rule is the simplest procedure and has small variability. But is is too determin- 
istic to be used in clinical trials. The randomized play-the-winner rule and urn models 
are random procedures. But their variabilities are very high. Theoretical results, sim- 
ulation studies and a real example in ECMO trial all indicate that the statistical test 
in using a adaptive design with high variability is not powerful. However, besides in 
adaptive designs, urn models have wide applications in many areas including biolog- 
ical science, random algorithm and sampling, information science, etc. And the urn 
models have strong relationship with multi-type branching processes. The study of 
urn models has been of interest in a long history. The DL rule is randomized pro- 
cedure and has the smallest variability among all the adaptive designs with limiting 
allocation proportion q 2 /(qi + 12)- But it can only target this particular proportion 
and can only be used for binary responses. When it is generalized to be able to tar- 
get any pre-specified allocation, the variability is no longer the smallest (c.f. Table 
[2]). Among the adaptive designs mentioned in this paper, the DBCD and RBCD are 
the only procedures that preserves randomization, attains or can approach the lower 
bound of the variability, can target any allocation and can be used for general discrete 
or continuous responses. 

The examples considered in this paper are binary response clinical trials. In prac- 
tices, the responses of clinical trials appear in various types. For more examples and 
discussion, we refer to a new book of Hu and Rosneberger (2006). In many clinical 





(d P (0)/de 1 ) 2 (p^/^or 1 + (dp(0)/de 2 ) 2 ((1 - P {0))h{e 2 )) 



-1 



trials, covariate information is available that has a strong influence on the responses 
of patients. For instance, the efficacy of a hypertensive drug is related to a patient's 
initial blood pressure and cholesterol level, whereas the effectiveness of a cancer treat- 
ment may depend on whether the patient is a smoker or a non-smoker. The theory of 
an adaptive design in using covariate information is much more complicated than those 
without covarites. A limit success in deriving the asymptotic properties of covariate- 
adjusted adaptive designs has been achieved by Zhang, Hu Cheung and Chan (2007). 
The power study and evaluation of the covariate- adjusted adaptive designs are our 
future studies. Also, in many clinical trials, the observed responses are usual survival 
data. Though it has been shown that the delay of treatment results does not effect 
the asymptotic properties in many adaptive designs, is is assume that the delayed 
responses are finally observed if the time is long enough. It is an interesting topic of 
studying the properties of the adaptive designs with missed or censored data. 

Finally, in this paper, we only present the asymptotic results. It is important 
to check the accuracy of the asymptotic approximations when using the theoretical 
results to evaluate or compare designs. Simulations have indicated that in most cases 
these designs closely approximate asymptotic results for a moderate sample size of 
n = 100. 
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